LISTING OF CLAIMS 



1. (currently amended) A method for a computer network user 
for creating a voice XML file automatically, comprising: 

providing a graphic user interface (GUI) for defining 
a plurality of icons, each of said icons corresponding to 
one or more attributes of voice XML; 

receiving user selection input of said icons to edit 
content displayed in said GUI ; 

recording an action stream of a user invoking said 
icons in the graphic user interface; and 

interpreting said action stream based on a library of 
voice XML tags to create the voice XML file. 

2. (original) The method according to claim 1, 
characterized in that said graphic user interface comprises 
a graphic user interface for adding one or more audio 
hyperlinks for a voice XML file automatically, wherein each 
icon, defined in said graphic user interface, corresponds 
to a kind of hyperlink. 

3. (original) The method according to claim 2, 
wherein said adding hyperlinks comprises adding the 
hyperlinks to a TTS voice XML file, and wherein said adding 
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comprises the user editing the TTS voice XML file in the 
edit area of said graphic user interface, marking or 
entering the parts to be added with the hyperlinks, 
invoking the corresponding icons and entering the 
corresponding hyperlink addresses. 

4. (original) The method according to claim 2, wherein 
said adding hyperlinks comprises adding the hyperlinks to a 
real-time-recorded audio voice XML stream, and wherein said 
adding comprises the user editing the TTS voice XML file in 
the edit area of said graphic user interface, marking or 
entering the parts to be added with the hyperlinks, 
invoking the corresponding icons and entering the 
corresponding hyperlink addresses, and wherein speech 
recognition technology is applied to find the parts in the 
real-time-recorded audio voice XML stream that match the 
parts entered by the user when interpreting said action 
stream based on a library of voice XML tags. 

5. (original) The method according to claim 3, 
characterized in that when the user marks or enters the 
same parts to be added with the hyperlinks in the edit area 
of the graphic user interface for many times and invokes 
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the same hyperlink attributes, the hyperlinks for the whole 
TTS voice XML stream are batch-added. 

6. (original) The method according to claim 4, 
characterized in that when user marks or enters the same 
parts to be added with the hyperlinks in the edit area of 
the graphic user interface for many times and invokes the 
same hyperlink attributes, the hyperlinks for the whole 
real-time-recorded audio voice XML stream are batch-added. 

7. (currently amended) A system for creating voice XML 
file automatically, comprising: 

a graphic user interface for defining a plurality of 
icons based on network user input, wherein each of said 
icons corresponds to one or more attributes of voice XML, 
and for receiving user input to edit content displayed in 
said GUI ; 

a voice XML tag generator for interpreting said action 
stream based on a library of voice XML tags and generating 
the corresponding voice XML tags; and 

a voice XML file generator for creating the voice XML 
file by combining the contents to be played with the tags 
generated by the voice XML tag generator according to voice 
XML syntax. 
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8. (original) A system according to claim 7, characterized 
in that said graphic user interface comprise a graphic user 
interface component for adding audio hyperlinks for 
VoiceXML file automatically, wherein each icon, defined in 
said graphic user interface, corresponds to a kind of 
hyperlink . 

9. (original) A system according to claim 8, wherein said 
adding the hyperlinks comprises adding hyperlinks for a TTS 
voice XML stream, and wherein said adding comprises the 
user editing the TTS voice XML file in the edit area of 
said graphic user interface, marking or typing the parts to 
be added the hyperlinks, invoking the corresponding icons 
and typing the corresponding hyperlink addresses. 

10. (original) A system according to claim 8, wherein said 
adding comprises adding the hyperlinks for real-time 
recorded audio voice XML stream and wherein said system 
further comprises a speech recognition engine, said adding 
comprising the user editing the TTS voice XML file in the 
edit area of said graphic user interface, marking or typing 
the parts to be added the hyperlinks, invoking the 
corresponding icons and typing the corresponding hyperlink 
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addresses, and wherein said interpreting said action stream 
based on a library of voice XML tags further comprises said 
speech recognition engine finding the parts in the real- 
time-recorded audio Voice XML stream that match the parts 
entered by the user. 

11. (original) A system according to claim 9, 
characterized in that when the user marks or enters the 
same parts to be added as hyperlinks in the edit area of 
the graphic user interface component for many times, and 
invokes the same hyperlinking attributes, said component 
adds the hyperlinks for the whole TTS voice XML stream. 

12. (original) A system according to claim 10, 
characterized in that when user marks or enters the same 
parts to be added as hyperlinks in the edit area of the 
graphic user interface component for many times, and 
invokes the same hyperlinking attributes, said component 
adds the hyperlinks for the whole real-time-recorded audio 
voice XML stream. 

13. (currently amended) A program storage device readable 
by machine tangibly embodying a program of instructions 
executable by said machine to perform method steps for 
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creating a voice XML file automatically, said method 
comprising the steps of: 

providing a graphic user interface for defining a 
plurality of icons based on network user input, each of 
said icons corresponds to one or more attributes of voice 
XML; 

receiving user selection input of said icons to edit 
content displayed in said GUI ; 

recording an action stream of a user invoking said 
icons in the graphic user interface; and 

interpreting said action stream based on a library of 
voice XML tags to create the voice XML file. 

14. (original) The program storage device according to 
claim 13 wherein said method is characterized in that said 
graphic user interface comprises a graphic user interface 
for adding one or more audio hyperlinks for a voice XML 
file automatically, and wherein each icon, defined in said 
graphic user interface, corresponds to a kind of hyperlink. 

15. (original) The program storage device according to 
claim 14, wherein said adding hyperlinks comprises adding 
the hyperlinks to a TTS voice XML file, and wherein said 
adding comprises the user editing the TTS voice XML file in 
the edit area of said graphic user interface, marking or 
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entering the parts to be added with the hyperlinks, 
invoking the corresponding icons and entering the 
corresponding hyperlink addresses. 

16. (original) The program storage device according 
to claim 14, wherein said adding hyperlinks comprises 
adding the hyperlinks to a real-time-recorded audio voice 
XML stream, and wherein said adding comprises the user 
editing the TTS voice XML file in the edit area of said 
graphic user interface, marking or entering the parts to be 
added with the hyperlinks, invoking the corresponding icons 
and entering the corresponding hyperlink addresses, and 
wherein speech recognition technology is applied to find 
the parts in the real-time-recorded audio voice XML stream 
that match the parts entered by the user when interpreting 
said action stream based on a library of voice XML tags. 

17. (original) The program storage device according to 
claim 15, characterized in that when the user marks or 
enters the same parts to be added with the hyperlinks in 
the edit area of the graphic user interface for many times 
and invokes the same hyperlink attributes, the hyperlinks 
for the whole TTS voice XML stream are batch-added. 
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18. (original) The program storage device according to 
claim 16, characterized in that when user marks or enters 
the same parts to be added with the hyperlinks in the edit 
area of the graphic user interface for many times and 
invokes the same hyperlink attributes, the hyperlinks for 
the whole real-time-recorded audio voice XML stream are 
batch-added. 
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